NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Algorithmic Collective Action in Machine Learning

Hardt, Moritz; Mazumdar, Eric; Mendler-Dunner, Celestine; Zrnic, Tijana (July 2023, Proceedings of Machine Learning Research)
Alternative Microfoundations for Strategic Classification

Jagadeesan, Meena; Mendler-Dünner, Celestine; Hardt, Moritz (July 2021, Proceedings of the 38th International Conference on Machine Learning, PMLR 139:4687-4697, 2021)

Full Text Available
From Optimizing Engagement to Measuring Value

https://doi.org/10.1145/3442188.3445933

Milli, Smitha; Belli, Luca; Hardt, Moritz (April 2021, Proceedings of ACM FAccT 2021)

Full Text Available
Retiring Adult: New Datasets for Fair Machine Learning

Ding, Frances; Hardt, Moritz; Miller, John; Schmidt, Ludwig (January 2021, Advances in Neural Information Processing Systems 34 (NeurIPS 2021))

Full Text Available
Revisiting Design Choices in Proximal Policy Optimization

Hsu, Chloe Ching-Yun; Mendler-Dünner, Celestine; Hardt, Moritz (September 2020, ArXivorg)
null (Ed.)
Proximal Policy Optimization (PPO) is a popular deep policy gradient algorithm. In standard implementations, PPO regularizes policy updates with clipped probability ratios, and parameterizes policies with either continuous Gaussian distributions or discrete Softmax distributions. These design choices are widely accepted, and motivated by empirical performance comparisons on MuJoCo and Atari benchmarks. We revisit these practices outside the regime of current benchmarks, and expose three failure modes of standard PPO. We explain why standard design choices are problematic in these cases, and show that alternative choices of surrogate objectives and policy parameterizations can prevent the failure modes. We hope that our work serves as a reminder that many algorithmic design choices in reinforcement learning are tied to specific simulation environments. We should not implicitly accept these choices as a standard part of a more general algorithm.
more » « less
Full Text Available
Strategic Classification is Causal Modeling in Disguise

Miller, John; Milli, Smitha; Hardt, Moritz (April 2020, ICML 2020)
null (Ed.)
Consequential decision-making incentivizes individuals to strategically adapt their behavior to the specifics of the decision rule. While a long line of work has viewed strategic adaptation as gaming and attempted to mitigate its effects, recent work has instead sought to design classifiers that incentivize individuals to improve a desired quality. Key to both accounts is a cost function that dictates which adaptations are rational to undertake. In this work, we develop a causal framework for strategic adaptation. Our causal perspective clearly distinguishes between gaming and improvement and reveals an important obstacle to incentive design. We prove any procedure for designing classifiers that incentivize improvement must inevitably solve a non-trivial causal inference problem. Moreover, we show a similar result holds for designing cost functions that satisfy the requirements of previous work. With the benefit of hindsight, our results show much of the prior work on strategic classification is causal modeling in disguise.
more » « less
Full Text Available
Performative Prediction

Perdomo, Juan; Zrnic, Tijana; Mendler-Dünner, Celestine; Hardt, Moritz (July 2020, International Conference on Machine Learning (PMLR))
null (Ed.)
When predictions support decisions they may influence the outcome they aim to predict. We call such predictions performative; the prediction influences the target. Performativity is a well-studied phenomenon in policy-making that has so far been neglected in supervised learning. When ignored, performativity surfaces as undesirable distribution shift, routinely addressed with retraining. We develop a risk minimization framework for performative prediction bringing together concepts from statistics, game theory, and causality. A conceptual novelty is an equilibrium notion we call performative stability. Performative stability implies that the predictions are calibrated not against past outcomes, but against the future outcomes that manifest from acting on the prediction. Our main results are necessary and sufficient conditions for the convergence of retraining to a performatively stable point of nearly minimal loss. In full generality, performative prediction strictly subsumes the setting known as strategic classification. We thus also give the first sufficient conditions for retraining to overcome strategic feedback effects.
more » « less
Full Text Available
Stochastic Optimization for Performative Prediction

Mendler-Dünner, Celestine; Perdomo, Juan C.; Zrnic, Tijana; Hardt, Moritz (July 2020, Advances in Neural Information Processing Systems 33 (NeurIPS 2020))
null (Ed.)
In performative prediction, the choice of a model influences the distribution of future data, typically through actions taken based on the model's predictions. We initiate the study of stochastic optimization for performative prediction. What sets this setting apart from traditional stochastic optimization is the difference between merely updating model parameters and deploying the new model. The latter triggers a shift in the distribution that affects future data, while the former keeps the distribution as is. Assuming smoothness and strong convexity, we prove rates of convergence for both greedily deploying models after each stochastic update (greedy deploy) as well as for taking several updates before redeploying (lazy deploy). In both cases, our bounds smoothly recover the optimal O(1/k) rate as the strength of performativity decreases. Furthermore, they illustrate how depending on the strength of performative effects, there exists a regime where either approach outperforms the other. We experimentally explore the trade-off on both synthetic data and a strategic classification simulator.
more » « less
Full Text Available
Stable Recurrent Models

Miller, John; Hardt, Moritz (April 2019, In Proceedings of ICLR 2019)

Full Text Available
Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

Sun, Yu; Wang, Xiaolong; Liu, Zhuang; Miller, John; Efros, Alexei A.; Hardt, Moritz (April 2020, ICML 2020)
null (Ed.)
In this paper, we propose Test-Time Training, a general approach for improving the performance of predictive models when training and test data come from different distributions. We turn a single unlabeled test sample into a self-supervised learning problem, on which we update the model parameters before making a prediction. This also extends naturally to data in an online stream. Our simple approach leads to improvements on diverse image classification benchmarks aimed at evaluating robustness to distribution shifts.
more » « less
Full Text Available

« Prev Next »

Search for: All records